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Abstract. The theory of linear dispersive equations predicts that waves should 
spread out and disperse over time. However, it is a remarkable phenomenon, 
observed both in theory and practice, that once nonlinear effects are taken 
into account, solitary wave or soliton solutions can be created, which can be 
stable enough to persist indefinitely. The construction of such solutions can 
be relatively straightforward, but the fact that they are stable requires some 
significant amounts of analysis to establish, in part due to symmetries in the 
equation (such as translation invariance) which create degeneracy in the sta- 
bility analysis. The theory is particularly difficult in the critical case in which 
the nonlinearity is at exactly the right power to potentially allow for a self- 
similar blowup. In this article we survey some of the highlights of this theory, 
from the more classical orbital stability analysis of Weinstein and Grillakis- 
Shatah-Strauss, to the more recent asymptotic stability and blowup analysis of 
Martel-Merle and Merle- Raphael, as well as current developments in using this 
theory to rigorously demonstrate controlled blowup for several key equations. 



1. Introduction 

In these notes we shall eventually describe recent developments in the stability 
theory of solitons (or more precisely, solitary waves). Before we discuss solitons, 
however, we need to first describe the wider context of dispersive equations, and 
why even the very existence of solitons were initially such a surprising phenomenon. 

In classical physics, it has been realised for centuries that the behaviour of ide- 
alised vibrating media (such as waves on string, on the surface of a body of water, 
or in air), in the absence of friction or other dissipative forces, can be modeled by a 
number of partial differential equations, known collectively as dispersive equations. 
Model examples of such equations include the following: 

• The free wave equation 

u tt - c 2 Au = 0, 

where u : R x R d — > R represents the amplitude u(t, x) of a wave at 
a point in a spacetime with d spatial dimensions, A := 2_/j=i ~§^s 1S the 

spatial Laplacian on R d , utt is short for ^p-, and c > is a fixed constant 
(which can be rescaled to equal 1 if one wishes). This equation models the 
evolution of waves in a medium which has a fixed speed c of propagation 
in all directions. 
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(1.1) 



• The linear (time- dependent) Schrddinger equation 

h 2 

ihut H Aii = Vu 

2m 



where a:Rx R d — * C is the wave function of a quantum particle, H, m > 
are physical constants (which can be rescaled to equal 1 if one wishes) , and 
V : R d — ► R is a potential function, which we assume to depend only on 
the spatial variable x. This equation models the evolution of a quantum 
particle in space in the presence of a classical potential well V. 
• The nonlinear Schrddinger (NLS) equation 



where p > 1 is an exponent and \i = ±1 is a sign (the case /i = +1 is 
known as the defocusing case, and fx = — 1 as the focusing case). This 
equation can be viewed as a variant of the linear Schrodinger equation 
(with the constants h and m normalised away), in which the potential V 
now depends in a nonlinear fashion on the solution itself. This equation no 
longer has a physical interpretation as the evolution of a quantum particle, 
but can be derived as a model for quantum media such as Bose-Einstein 
condensates (see e.g. [73]). 

• The (time- dependent) Airy equation 

(1.3) u t +u xxx = 

where u:RxR->Risa scalar function. This equation can be derived 
as a very simplified model for propagation of low amplitude water waves 
in a shallow canal, by starting with the Euler equations, making a number 
of simplifying assumptions to discard nonlinear terms, and the normalising 
all constants to equal 1. 

• The Korteweg-de Vries (KdV) equation [32] 

(1.4) u t + u xxx + 6uu x = 

which is a more refined version of the Airy equation in which the first 
nonlinear term is retained. The constant 6 that appears here is not essential, 
but turns out to be convenient when connecting this equation to the theory 
of inverse scattering (of which more will be said later). 

• The generalised Korteweg-de Vries (gKdV) equation 

(1.5) u t + u xxx + (u p ) x = 

for p > 1 an integer; the case p — 2 is essentially the KdV equation, and the 
case p = 3 is known as the modified Korteweg-de Vries (mKdV) equation. 
The case p — 5 is particularly interesting due to its mass-critical nature, 
which we will discuss later. 

For simplicity (to avoid some non-trivial topological phenomena, which are an 
interesting topic which we have no space to discuss here) we shall only consider 
equations on flat spacetimes R™, and only consider solutions which decay to zero 
at spatial infinity. 

The above equations are all evolution equations; if we specify the initial position 
w(0, x) = uq{x) of a wave at time t ~ 0, we expect these equations to have a 



(1.2) 



iu t + Ait = n\u\ 



u 
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unique solution with that initial dat£0 for all future times t > 0. Actually, all of 
the above equations are time-reversible (for instance, if (t, x) i— > u(t,x) solves (|1.4|) . 
then (t, x) i — ^ u(—t, —x) also solves (|1.4[1 ) so we also expect the initial data at time 
t = to determine the solution at all past times t < 0. (This is in sharp contrast 
to dissipative equations such as the heat equation w t = Ait, which are solvable 
forward in time but are not solvable backwards in time, at least in the category of 
smooth functions, due to an irreversible loss of energy and information inherent in 
this equation as time moves forward.) 

Solutions to the above equations have two properties, which may seem at first 
to contradict each other. The first property is that all of these equations are 
conservative] there exists a Hamiltonian v i— > H(v), which is a functional that 
assigns a real number H(v) to any (sufficiently smooth and decaying) function v 
on the spatial domain R d , such that the HamiltoniarJl H(u(t)) of a (sufficiently 
smooth and decaying) solution u(t) to the above equation is conserved in time: 

H(u(t)) = H(u(0)), or equivalently d t H(u(t)) = 0. 

More specifically, the Hamiltonian is given by 

H(u(t),u t (t)) := / hut(t,x)\ 3 + -\V x u(t,x)\ 3 dx 
J R d A A 

for the wave equation, 

H(u(t)):= ( ^-\\7 x u(t,x)\ 2 + V(x)\u{t,x)\ 2 dx 

for the linear Schrodinger equation, 

H(u(t)) := f i|V a u(t, x)\ 2 + -£r|u(t, x)Y> +1 dx 
Jr" 2 p+1 

for the nonlinear Schrodinger equation, 

H(u(t)) := / u x (t,xf dx 

for the Airy equation, 

(1.6) H(u(t)):= / u x (t,x) 2 - 2u(t,x) 3 dx 
for the Korteweg-de Vries equation, and 

(1.7) H(u(t)):= { uM,x) 2 — u(t, x) p+1 dx 

Jr P+ 1 

for the generalised Korteweg-de Vries equation. In all of these cases, the conser- 
vation of the Hamiltonian can be formally verified by computing dtH(u(i)) via 

^For the wave equation, which is second-order in time, we also need to specify the initial 
velocity dtu(0,x) = u\(x). 

2 To simplify the exposition, we shall largely ignore the important, but technical, analytic issues 
of exactly how much regularity and decay one needs in order to justify all the computations and 
assertions given here. In practice, one usually first works in the category of classical solutions - 
solutions that are smooth and rapidly decreasing - and then uses rigorous limiting arguments (and 
in particular, exploiting the low-regularity well-posedness theory of these equations) to extend all 
results to more general classes of solutions, such as solutions in the energy space H 1 (R d ). See 
e.g. [9], [29] . |79j for further discussion. 

■^Again, with the wave equation, the Hamiltonian depends on the instantaneous velocity ut(t) 
of the solution at time t as well as the instantaneous position u(t). 



4 



TERENCE TAO 



differentiation under the integral sign, substituting the evolution equation for u, 
and then integrating by parts; we leave this to the reader as an exercistQ. 

Actually, the Hamiltonian is not the only conserved quantity available for these 
equations; each of these equations also enjoy a number of symmetries (e.g. trans- 
lation invariance), which (by Noether's theorem) leads to a number of important 
additional conserved quantities, which we will discuss later. It is often helpful to 
interpret the Hamiltonian as describing the total energy of the wave. 

The conservative nature of these equations means that even for very late times 
t, the state u(t) of solution is stilQ "similar" to the initial state u(0), in the sense 
that they have the same energy. This may lead one to conclude that solutions to 
these evolution equations should evolve in a fairly static, or fairly periodic manner; 
after all, this is what happens to solutions to finite-dimensional systems of ordinary 
differential equations which have a conserved energy H which is coercive in the 
sense that the energy surfaces {H = const} are always bounded. 

However, this intuition turns out to not be correct in the realm of dispersive 
equations, even though such equations can be thought of as infinite-dimensional 
systems of ODE with a conserved energy, and even though this energy usually ex- 
hibits coercive properties. This is ultimately because of a second property of all 
of these equations, namely dispersion. Informally, dispersion means that different 
components of a solution u to any of these equations travel at different velocities, 
with the velocity of each component determined by the frequency. As a conse- 
quence, even though the state of solution at late times has the same energy as the 
initial state, the different components of the solution are often so dispersed that the 
solution at late times tends to have much smaller amplitudes than at early timetQ. 
Thus, for instance, it is perfectly possible for the solution u(t) to go to zero in 
L°°(Il d ) norm as t — > ±00, even as its energy stays constant (and non-zero). The 
ability to go to zero as measured in one norm, while staying bounded away from 
zero in another, is a feature of systems with infinitely many degrees of freedom, 
which is not present when considering systems of ODE with only boundedly many 
degrees of freedom. 

One can see this dispersive effect in a number of ways. One (somewhat informal) 
way is to analyse plane wave solutions 

(1.8) u(t, x) = Ae ltT+x< 

for some non-zero amplitude A, some temporal frequency t£R, and some spatial 
frequency £s R d . For instance, for the Airy equation (jl.3p . one easily verifies that 
(|1.8p solve^j (|1.3j) exactly when r = £ 3 ; this equation is known as the dispersion 
relation for the Airy equation. If we rewrite the right-hand side of ()1 .8(1 in this case 

4 One can also formally establish conservation of the Hamiltonian by interpreting each of the 
above dispersive equations in turn as an infinite-dimensional Hamiltonian system, but we will not 
adopt this (important) perspective here; see 38 for further discussion. 

^This is assuming that the solution exists all the way up to this time t, which can be a difficult 
task to establish rigorously, especially if the initial data was rough. Again, we suppress these 
important technical issues for simplicity. 

^This phenomenon may seem to be inconsistent with time reversal symmetry. However, this 
dispersive effect only occurs when the initial data is spatially localised; dispersion sends localised 
high-amplitude states to broadly dispersed, low-amplitude states, but (by time reversal) can also 
have the reverse effect. 

^Strictly speaking, one needs to allow solutions u to 11.31 to be complex-valued here rather 
than real-valued, but this is of course a minor change. 
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as Ae l ^ x ~(~^ , this asserts that a plane wave solution to (11. 3|) has a phase velocity 
— £ 2 which is the negative square of its spatial frequency £. Thus we see that for 
this equation, higher frequency plane waves have a much faster phase velocity than 
lower frequency ones, and the velocity is always in a leftward direction. Similar 
analyses can be carried out for the other equations given above, though in those 
equations involving a nonlinearity or a potential, one has to restrict attention to 
small amplitude or high frequency solutions so that one can (non-rigorously) neglect 
the effect of these terms. For instance, for the Schrodinger equation (jl.ip (at least 
with V = 0) one has the dispersion relation 

(1.9) ""^Ifl 2 - 

However, as is well known in physics, the phase velocity does not determine the 
speed of propagation of information in a system; that quality is instead controlled 
by the group velocity, which typically is slightly different from the phase velocity. 
To explain this quantity, let us modify the ansatz (|1.8j) by allowing the amplitude 
A to vary slowly in space, and propagate in time at some velocity v 6 R d . More 
precisely, we consider solutions of the form 

u(t,x) « A(e(x ~ vt))e ltT+x -t 

where A is a smooth function and s > is a small parameter, and we shall be vague 
about what the symbol means. If we have r = £ 3 as before, then a short (and 
slightly non-rigorous) computation shows that 

u t + u xxx » e(v + 2,e)A'(e(x - vt))e ltT+x < + 0(e 2 ). 

Thus we see that in order for u to (approximately) solve (|1.3|) up to errors of 0(e 2 ), 
the group velocity v must be equal to — 3£ 2 , which is three times the phase velocity 
— £ 2 . Thus, at a qualitative level at least, we still have the same predicted behaviour 
as before; all frequencies propagate leftward, and higher frequencies propagate faster 
than lower ones. In particular we expect localised high-amplitude states, which can 
be viewed (via the Fourier inversion formula) as linear superpositions of plane waves 
of many different frequencies, to disperse leftwards over time into broader, lower- 
amplitude states (but still with the same energy as the original state, of course). 

One can perform similar analyses for other equations. For instance, for the linear 
Schrodinger equation, and assuming either high frequencies or small potential, one 
expects waves to propagate at a velocity proportional to their frequency, according 
to de Broglie's law mv = H£,; similarly for the nonlinear Schrodinger equation when 
one assumes either high frequencies or small amplitude. In contrast, for wave equa- 
tion, this analysis suggests that waves of (non-zero) frequency £ should propagate 
at velocities j|yc; thus the propagation speed c is constant but the propagation 

direction j|t varies with frequency, leading to a weak dispersive effect. For more 
general dispersive equations, the group velocity can be read off of the dispersion 
relation r = t(£) by the formula v = — Vjt (whereas in contrast, the phase velocity 
is -^r). 

In the case of the Schrodinger equation with V = 0, one can see the dispersive 
effort more directly by using the explicit solution formula 

1 
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for t ^ and all sufficiently smooth and decaying initial data uq- Indeed, we 
immediately conclude from this formula (formally, at least) that if uq is absolutely 
integrable, then ||u(t)||^oo( R d) decays at the rate 0(|i|~ ti//2 ). 

In the case of linear equations such as the Airy equation (II. 3|) . there is a similar 
explicit formula (involving the Airy function Ai(x) instead of complex exponentials), 
but one can avoid the use of special functions by instead proceeding using the 
Fourier transform and the principle of stationary phase (see e.g. [75]). Indeed, by 
starting with the Fourier inversion formula 

u (x) = f u (Oe lxi <% 

JR 

where Uo(£) := f-R u o( x ) e ~~ lx ^ dx is the Fourier transform of uq, and noting as 
before that e lt € 3 + tx t is the solution of the Airy equation with initial data e lx ^, we see 
from the principle of superposition (and ignoring issues of interchanging derivatives 
and integrals, etc.) that the solution u is given by the formula 

(1.10) u(t,x)= [ u (x)e lt ^ +lx ^ d£. 

If uq is a Schwartz function (infinitely smooth, with all derivaives decreasing faster 
than any polynomial), then its Fourier transform is also Schwartz and thus slowly 
varying. On the other hand, as t increases, the phase e 1 *^ oscillates more and 
more rapidly (for non-zero £) , and so we expect an increasing amount of cancellation 
in the integral in (|1.10|) . leading to decay of u as t — > 00. This intuition can be 
formalised using the methods of stationary phase (which can be viewed as advanced 
applications of the undergraduate calculus tools of integration by parts and changes 
of variable), and can for instance be used to show that ||u(£)||l°°(r) decays at a 
rate 0(i -1 / 3 ) in general. 

This technique of representing a solution as a superposition of plane waves also 
works (with a twist) for the linear Schrodinger equation (jl.ip in the presence of a 
potential V, provided that the potential is sufficiently smooth and decaying. The 
basic idea is to replace the plane waves (|1.8p by distorted plane waves $(r, x)e ltT 7 
where (in order to solve (|l.l[l ) <f> has to solve the time-independent Schrodinger 
equation 

h 2 

(1.11) -Th$ + A$ = V$, 

2m 

and then to try to represent solutions u to (jl.ip as superpositions 

where we are being intentionally vague as to what the range of integration is. If 
we restrict attention to negative values of r, then it turns out (by use of scattering 
theory) that we can construct distorted plane waves $(r, x) which asymptotically 
resemble the standard plane waves e lx '^ as \x\ — > 00, where £ is a frequency obeying 
the dispersion relation (|1.9jl . If u is composed entirely of these waves, then one has 
a similar dispersive behaviour to the free Schrodinger equation (for instance, under 
suitable regularity and decay hypotheses on V and uq, \\u(t)\\ L oa ^ R d^ will continue 
to decay like 0(t~ d / 2 )). In such cases we say that u is in a radiating state. In many 
important cases (such as when the potential V is non-negative, or is small in certain 
function space norms), all states (with suitable regularity and decay hypotheses) 
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are radiating states. However, when V is large and allowed to be negative, it is 
also possiblqj to contain bound states, in which r is positive, and the distorted 
plane wave 3>(r, x) is replaced by an eigenfunction $, which continues to solve 
the equation (jl.lip , but now <!> decays exponentially to zero as |x| — > oo, instead 
of oscillating like a plane wave as before. (Informally, this is because once r is 
positive, the dispersion relation (|1.9p is forcing £ to be imaginary rather than real.) 
In particular, $ lies in L 2 (R d ), and so — r becomes an eigenvalue of the Schrodinger 

operatoiQ H := — -J^-A+V. Because multiplication V is a compact operator relative 

r 2 

to A, standard spectral theory shows that the set of eigenvalues — r is discrete 
(except possibly at the origin — r = 0). Note that it is necessary for V to take on 
negative values in order to obtain negative eigenvalues, since otherwise the operator 
H is positive semi-definite. 

If i*o consists of a superposition of one or more of these eigenfunctions, e.g. 

= y^Cfc ( l ) (rfc,x) 

k 

where — Tk ranges over finitely many of the eigenvalues of H, then we formally have 

«(t)=^c fe e i4rfc $(r fe) x) ) 

k 

and so we see that u(t) oscillates in time but does not disperse in space. In this 
case we say that u is a bound state. Indeed, the evolution is instead almost periodic, 
in the sense that liminft^oo \\u(t) — wo||L 2 (R. d ) = 0, or equivalently that the orbit 
{u(t) : t S R} is a precompact subset of L 2 (R d ). 

By further application of spectral theory, one can show that an arbitrary state 
uq (in, say, L 2 (R d )) can be decomposed as the orthogonal sum of a radiating 
state, which disperses as t — > oo, and a bound state, which evolves in an almost 
periodic manner. Indeed this decomposition corresponds to the decomposition of 
the spectral measure of H into absolutely continuous and pure point components. 

2. SOLITONS 

We have seen how solutions to linear dispersive equations either disperse com- 
pletely as t — > oo, or else (in the presence of an external potential) decompose into 
a superposition of a radiative state that disperses to zero, plus a bound state that 
exhibits phase oscillation but is otherwise stationary. 

In everyday physical experience with water waves, we of course see that such 
waves disperse to zero over time; once a rock is thrown into a pond, for instance, 
the amplitude of the resulting waves diminish over time. However, one does see in 
nature water waves which refuse to disperse for astonishingly long periods of time, 
instead moving at a constant speed without change in shape. Such solitary waves 



"When V does not decay rapidly, then there can also be some intermediate states involving the 
singular continuous spectrum of the Schrodinger operator — + V , which disperse over time 

slower than the radiating states but faster than the bound states. One can also occasionally have 
resonances corresponding to the boundary case r = 0, which exhibit somewhat similar behaviour. 
For simplicity of exposition, we will not discuss these (important) phenomena. 

^This operator H is related to the Hamiltonian H(u) discussed earlier by the formula H (u) = 
(Hu,u), where (u,v) := f Rd uv is the usual inner product on L 2 (R d ). 
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or solitons^^ were first observed by John Scott Russell, who followed such a wave in 
a shallow canal on horseback for over a mile, and then reproduced such a traveling 
wave (which he called a "wave of translation" ) in a wave tank. 

This phenomenon was first explained mathematically by Korteweg and de Vries 
[32] in 1895, using the equation (|1.4p that now bears their name (although this 
equation was first proposed as a model for shallow wave propagation by Boussinesq 
a few decades earlier) . Indeed, if one considers traveling wave solutions to (|1.4p of 
the form 

u(t, x) — f(x — ct) 

for some velocity c, then this will be a solution to (| 1 .4(1 as long as / solves the ODE 

-cf + f" + 6//' = 0. 

If we assume that / decays at infinity, then we can integrate this third-order ODE 
to obtain a second-order ODE 

-cf + f" + 3/ 2 = 0. 

For c > 0, this ODE admits the localised explicit solutions f(x) = cQ(c 1 / 2 (x — xo)) 
for any xq € R, where Q is the explicit Schwartz function Q(x) := i sech 2 (|). For 
c < 0, one can show that there are no localised solutions other than the trivial 
solution / = 0. Thus we obtain a family of explicit solitary wave solutions 

(2.1) u(t,x) = cQ(c Y l 2 (x - ct- x )) 

to the KdV equation; the parameter c thus controls the speed, amplitude, and 
width of the wave, while x$ determines the initial location. 

Interestingly, all the solutions (|2.1[) move to the right, while radiating states move 
to the left. This phenomenon is somewhat analogous to the situation with the linear 
Schrodinger equation, in which the temporal frequency t (which is somewhat like 
the propagation speed c in KdV) was negative for radiating states and positive 
for bound states. Similar solitary wave solutions can also be found for gKdV and 
NLS equations, though in higher dimensions d > 1 one cannot hope to obtain such 
explicit formulae for these solutions, and instead one needs to use more modern 
PDE tools, such as calculus of variations and other elliptic theory methods, in 
order to build such solutions; see e.g. [3], 0], [21]. There are also larger and 
more oscillatory "excited" solitary wave solutions which, unlike the "ground state" 
solitary wave solutions described above, exhibit changes of sign, but we will not 
discuss them here. 

Early numerical analyses of the KdV equation 86j, [TH] revealed that these soliton 
solutions (|2.1[) were remarkably stable. Firstly, if one perturbed a soliton by adding 
a small amount of noise, then the noise would soon radiate away from the soliton, 
leaving the soliton largely unchanged (other than some slight perturbation in the 
c and xq parameters); these phenomena are described mathematically by results 
on the orbital stability and asymptotic stability of solitons, of which more will be 

^Strictly speaking, a wave which is localised and maintains its form for long periods of time is 
merely a soilitary wave. A soliton is a solitary wave with the additional property that solitons and 
other radiation can pass through it without destroying its form. The question of understanding 
the collision between two solitary waves for non-integrable equations is still poorly understood 
despite some recent progress, so we shall focus here instead on the more perturbative question of 
how solitary waves interact with small amounts of radiation. In the literature on that subject, it 
is then customary to refer to the solitary wave as a soliton, though this is technically an abuse of 
notation. 
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said later. This is perhaps unsurprising, given that solitons move rightwards and 
radiation moves leftwards, but one has to bear in mind that equations such as (|1.4j) 
are not linear, and in particular one cannot obviously superimpose a soliton and a 
radiative state to create a new solution to the KdV equation. 

What was even more surprising was what happened if one considered collisions 
between two solitons, for instance imagining initial data of the form 

u(0, x) = c\Q(c^ 2 (x — xi)) + c 2 Q{c 2 2 {x - x 2 )) 

with < c 2 < Ci and X\ far to the left of x 2 \ thus initially we have a larger, fast- 
moving soliton to the left of a shallower, slow-moving soliton. If the KdV equation 
were linear, the solution would now take the form 

u(t, x) = ciQ{c\l 2 {x - c\t - xi)) + c 2 Q{c 2 2 {x - c 2 t - x 2 )) 

and so the faster solitons would simply overtake the slower one, with no interaction 
between the two. At the other extreme, with a strongly nonlinear equation, one 
could imagine all sorts of scenarios when two solitons collide, for instance that they 
scatter into radiation or into many smaller solitons, combine into a large soliton, 
and so forth. However, the KdV equation exhibits an interesting intermediate 
behaviour: the solitons do interact nonlinearly with each other during collision, 
but then emerge from that collision almost unchanged, except that the solitons 
have been shifted slightly by their collision. In other words, for very late times i, 
the solution approximately takes the form 

u(t, x) w ciQ{c\^ 2 {x — c\t — x\ — #1)) + c 2 Q{c 1 2 2 {x — c 2 t — x 2 — 6 2 )) 

for some additional shift parameters 0\ , 2 € R. 

More generally, if one starts with arbitrary (but smooth and decaying) initial 
data, what usually happens (numerically, at least) with evolutions of equations 
such as (jl.4p is that some non-linear (and chaotic-seeming) behaviour happens for 
a while, but eventually most of the solution radiates away to infinity and a finite 
number of solitons emerge, moving away from each other at different rates. Quite 
remarkably, this behaviour can in fact be justified rigorously for the KdV equation 
and a handful of other equations (such as the NLS equation in the cubic one- 
dimensional case d = l,p = 3) due to the inverse scattering method, which we shall 
discuss shortly, although even in those cases, there are some exotic solutions, such 
as "breather" solutions, which occasionally arise and which do not evolve to a su- 
perposition of solitons and radiation, but instead exhibit periodic or almost periodic 
behaviour in time. Nevertheless, it is widely believed (and supported by extensive 
numerics) that for many other dispersive equations (roughly speaking, those equa- 
tions whose nonlinearity is not strong enough to cause finite time blowup, and more 
precisely for the subcritical equations), solutions with "generic" initial data should 
eventually resolve into a finite number of solitons, moving at different speeds, plus a 
radiative term which goes to zero. This (rather vaguely defined) conjecture goes by 
the name of the soliton resolution conjecture. Except for those few equations which 
admit exact solutions (for instance, by inverse scattering methods), the conjecture 
remains unsolved in general, in part because we have very few tools available that 
can say anything meaningful about generic data in a certain class (e.g. with some 
function norm bounds) without also being applicable to all data in that class; thus 
the presence of a few exotic solutions that do not resolve into solitons and radi- 
ation seems to prevent us from tackling all the other cases. Nevertheless, there 
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are certain important regimes in which we do have a good understanding. One of 
these is the perturbative regime near a single soliton, in which the initial state uo 
is close to that of a soliton such as (|2.ip ; this case will be the main topic of our 
discussion. More recently, results have begun to emerge on multisoliton states, in 
which the solution is close to the superposition of many widely separated solitons, 
and even more recently still, there has been some results on the collision between 
a very fast narrow soliton and a very slow broad one. However, it seems that truly 
non-perturbative regimes, such as the collisions between two solitons of comparable 
size, remain beyond the reach of current tools (perhaps requiring a new advance in 
our understanding of dynamical systems in general). (See [77], [76], [78] for further 
discussion.) 

3. The inverse scattering approach 

We now briefly mention the technique of inverse scattering, which is a non- 
perturbative approach which allows one to control the evolution of solutions to 
completely integrable equations such as (|1.4[) . (Similar techniques apply to one- 
dimensional cubic NLS, see e.g. [T], [73], [87], [67].) This is a vast subject, which 
can be viewed from many different algebraic and geometric perspectives; we shall 
content ourselves with describing the approach based on Lax pairs [41] . which has 
the advantage of simplicity, provided that one is willing to accept a rather miracu- 
lous algebraic identity. 

The identity in question is as follows. Suppose that u solves the KdV equation 
(|1 .4|) . As always we assume enough smoothness and decay to justify the compu- 
tations that follow. For every time t, we consider the time-dependent differential 
operators L(t), P(t) acting on functions on the real line R, defined by 

ax° 4 ax ax 

where we view u(t) as a multiplication operator, / i— ► u(t)f. One can view P(t) 
as a truncated (non-commutative) Taylor expansion of L(t) 3 ^ 2 . In view with this 
interpretation, it is perhaps not so surprising that L(t) and P{t) "almost commute" ; 
the commutator [P(t),L(t)] := P(t)L(t) - L{t)P(t) of the third order operator P(t) 
and the second order operator L(t) would normally be expected to be fourth order, 
but in fact things collapse to just be zeroth order. Indeed, after some computation, 
one eventually obtains 

[P(t),L(t)} = 

. \^XXX 

(t) + 6u(t)u x (t)). 

In particular, if we substitute in (|1.4[) , we obtain the remarkable Lax pair equation 
(3.1) j t L{t)=A[P{t),L{t)\- 

If we non-rigorously treat the operators L(t), P(t) as if they were matrices, we can 
interpret this equation as follows. Using the Newton approximation 

L(t + dt) m L(t) + dt—L(t); cxp(±P(t)dt) sa (1 ± P(t)dt) 
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for infinitesimal dt, we see from (|3.1jl that 

(3.2) L(t + dt) w exp(4P(t)db)L(t) exp(-4P(t)dt). 

This informal analysis suggests that Lit + dt) is a conjugate of Lit), and so on 
iterating this we expect L(t) to be a conjugate of L(0). In particular, the spectrum 
of L(t) should be time- invariant. Since L(t) is determined by u(t), this leads to a 
rich source of invariants for u(t). 

The above analysis can be made more rigorous. For instance, one can show 
that the traced tv(e sL ^) of heat kernels are independent of t for any fixed s > 0; 
expanding those traces in powers of s one can recover an infinite number of conser- 
vation laws, which includes the conservation of the Hamiltonian (|1.6j) as a special 
case. We will not pursue this approach further here, and refer the reader to [25j . 
Another way to proceed is to consider solutions to the generalised eigenfunction 
equation 

(3.3) L(t)<f>(t,x) = T<f>(t,x) 

for some r S R and some smooth function cf>(t, x) = <fi(t, x; r) (not necessarily 
decaying at infinity). If the equation (|3 . 3|) holds for a single time t (e.g. t = 0), 
and if <fi then evolves by the equation 

(3.4) c(> t (t,x) = 4:P(t)(t>(t,x) 

for all t, one can verify (formally, at least) that (|3.3p persists for all t, by differenti- 
ating (|3.3p in time and substituting in (|3.1[) and (|3.4j) . (The astute reader will note 
that these manipulations are equivalent to those used to produce (|3.2[) . 

This now suggests an strategy to solve the KdV equation exactly from an arbi- 
trary choice of initial data it(0) = uq. 

(1) Use the initial data uq to form the operator L(Q), and then locate the 
generalised eigenfunctions 0(0, x; A) for each choice of spectral parameter 

T. 

(2) Evolve each generalised eigenfunction </> in time by the equation (I3.4j) . 

(3) Use the generalised eigenfunctions 4>it, x;t) to recover L(t) and u(t). 

This strategy looks very difficult to execute, because the operator Pit) itself 
depends on u(t), and so (|3.4| cannot be solved exactly without knowing what u(t) 
is - which is exactly what we are trying to find in the first place! But we can break 
this circularity by only seeking to solve (|3.4|) at spatial infinity x = ±oo. Indeed, if 
u(t) is decaying, and r = — £ 2 for some real number £, then we see that solutions 
(pit, x) to (|3 . 3[) must take the form 

0(t,aO«o±(£;t)e** + & ± (£;t)e-«* 

as x — > ±oo, for some quantities a±(£;t), &±(£; t) € C, which we shall refer to as 
the scattering data of Lit). (One can normalise, say, a-(0) = 1 and 6_(£;0) = 0, 
and focus primarily on a+(£;i) and £>+(£;£), if desired.) Applying (|3.4|) and using 
the decay of w(£) once again, we are then led (formally, at least) to the asymptotic 
equations 

dta±(£;t) =4(i£) 3 a±(£;*); StMfct) = 4(-i£) 3 &±(£;*) 



^Actually, to avoid divergences we need to consider normalised traces, such as tr(e si (*' — 

8 d2 
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This only handles the case of negative energies A < 0. For positive energies, say 
A = +£ 2 for some £ > 0, the situation is somewhat similar; in this case, we have 
a discrete set of £ for which we have a decaying solution <p(t,x), with <p(t, x) ~ 
c±(t;;t)e^ x for x — > ±oo, where 



This suggests a revised strategy to solve the KdV equation exactly: 

(1) Use the initial data uq to form the operator L(Q), and then locate the 
scattering data a±(£;0), 6±(£;0), c±(£;0). 

(2) Evolve the scattering data by the equations (|3.5[) . (|3.6|) . 

(3) Use the scattering data at t to recover L(t) and u(t). 

The main difficulty in this strategy is now the third step, in which one needs 
to solve the inverse scattering problem to recover u(t) from the scattering data. 
This is a vast and interesting topic in its own right, and involves complex-analytic 
problems such as the Riemann-Hilbert problem; we will not discuss it further here 
(but see e.g. |20], [T], [HI]). Suffice to say, though, that after some work, it is 
possible to execute the above strategy for sufficiently smooth and decaying initial 
data u to obtain what is essentialljo an explicit formula for u. 

The relationship of all this to solitons is as follows. Recall from our discussion of 
the linear Schrodinger equation (jl.ip that the operator L(0) = 4^ + uo is going to 
have radiating states (or absolutely continuous spectrum) corresponding to negative 
energies r = — £ 2 < 0, and a discrete set of positive eigenfunctions corresponding 
to positive energies t = +£ 2 > 0. Generically, the eigenvalues are simpltQ. In 
that case, it turns out that the inverse scattering procedure relates each eigenvalue 
+£ 2 of L(0) to a soliton present inside uq\ the value of £ determines the scaling 
parameter c of the soliton, and the scattering data c±(£; 0) determines (in a slightly 
complicated fashion, depending on the rest of the spectrum) the location of the 
solitons. The remaining scattering data a±(£;0), &±(£;0) determines the radiative 
portion of the solution. As the solution evolves, the spectrum stays constant, but 
the data a±,b±,c± changes in a controlled manner; this is what causes the solitons 
to move and the radiation to scatter. It turns out that the exact location of each 
soliton depends to some extent on the relative sizes of the constants c±, which are 
growing or decaying exponentially at differing rates; it is because of this that as 
one soliton overtakes another, the location of each soliton gets shifted slightly. 



12 Note the resemblance of the phases here to those in (II.IOH , This is not a co-incidence, and 
indeed the scattering and inverse scattering transforms can be viewed as nonlinear versions of the 
Fourier and inverse Fourier transform. 

^The solution is not quite expressible as a closed- form integral, as in 11.101 , but can be built 
out of solving a number of ordinary differential-integral equations (such as the Gelfand-Levitan- 
Marchenko equation, see e.g. |160 . which turns out to suffice for the purposes of analysing the 
asymptotic behaviour of the solution. 

14 Repeated eigenvalues lead to more complicated behaviour, including breather solutions and 
logarithmically divergent solitons. 



(3.6) 



c± (£ ; t) = e ^ 3 * c± (£;0). 
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4. The analytic approach 

The inverse scattering method gives extremely powerful and precise information 
on very general (and in particular, non-perturbative) solutions to equations such as 
the Korteweg-de Vries equation. However, it does not seem to be directly applicable 
to more general equations, such as the gKdV equation (|1.5|) foiT"^ p / 2,3. For 
instance, no reasonable Lax pair formulation exists for these equations. We now 
turn to more analytic techniques, which are less sensitive to the fine algebraic 
structure of the equation, although they still do rely very heavily on conservation 
laws and their relatives, such as monotonicity formulae. 

We shall mostly restrict attention to the gKdV equation (|1.5[) . We have al- 
ready identified one conserved quantity for this equation, namely the energy (|1.7)l . 
Another such conserved quantity is the mass 

M(u(t)) := / u(t,x) 2 dx. 
Jr. 

Together, the mass and energy can (in some cases) control the H 1 norm 
II u (*)IIhi(r) := / u(t,x) 2 +u x (t,x) 2 dx. 

JR 

Indeed, if we are in the mass-subcritical case p < 5, then the Gagliardo-Nirenberg 
inequality 

(4.1) / v^<C(p)(f v 2 )^(f 

JR JR JR 

valid for any v with suitable decay and regularity, gives us the a priori bound 

(4.2) ||«(t)||jr;(H) < C(M(u(t)), H(u(t))) = C(M{u ), H(u )) 

for some quantity C{M(uq), H(uq)) depending on the initial mass or energy. The 
condition p < 5 is necessary to ensure that the exponent in (|4.1[) is strictly less 
than 1. This condition can also be deduced from scaling heuristics, by investigating 
how the mass and energy transform under the scale invariance 

(4.3) u(t,x)^ \-p^u(^,j). 

It is possible to use the a priori bound (14. 2)1 . combined with the Picard iteration 
method for constructing solutions, and some moderately advanced estimates from 
harmonic analysis, to show that the equation Ijl. 5j) in the mass-subcritical case 
admits a unique global smooth solutions from arbitrary smooth, decaying data; see 
(29) . Thus there is no problem with existence, uniqueness, or regularity when it 
comes to these equations; the only remaining analytic issue (albeit a difficult one) 
is to understand the asymptotic behaviour of solutions. 

The above analysis for p < 5 is valid no matter how large the mass M{u) = 
M(uq) of the solution is. If we then turn to the mass-critical case p = 5, the 
situation changes; the a priori bound (|4.2|) is now only valid when the mass M(uq) 
is sufficiently small. In fact, by using the sharp Gagliardo-Nirenberg inequality of 



"The modified KdV equation with p = 3 turns out to also be completely integrable; in fact, 
it can even be transformed directly into the KdV equation by a simple operation known as the 
Miura trans forro l64l . which wc will not discuss further here. 
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Weinstein(83j. one can be more precise as follows. Given any p > 1, the equation 
(|1.5|) admits a family of soliton (or solitary wave) solutions similar to (|2.1| , namely 

(4.4) u{t, x) = c 1 /( p - 1 )Q(c 1 / 2 (x - xo - ct)) 

where 



Q(x) 



\ i/(p-i) 
p+1 \ 



v 2cosh 2 (^x) y 
is a positive, smooth, rapidly decreasing solution to the ODE 

(4.5) Q x x + Q P = Q. 

(In fact, up to translation, Q is the only such solution; see [llj . |39j.) In particular, 
we have the standard soliton solution to gKdV 

(4.6) u(t,x)=Q(x-t); 

all other solitons differ from the standard soliton only by the scaling (14.3[) and the 
translation invariance. 

In the mass-critical case p = 5, all of the solitons have the same mass, namely 

M(u)=M(Q) = [ Q 2 . 

The Gagliardo-Nirenberg inequality of Weinstein can then be used to show that one 
has the a priori bound (|4.2p in the p — 5 case as long as one only considers solutions 
with mass strictly less than that of the ground state, and so long as the solution 
exists; see [53] ■ The latter caveat is a substantial one; it is conjectured that one has 
global existence of solutions for gKdV of the p — 5 from smooth decaying initial 
data whenever the mass is less than that of the ground state, but this conjecture 
is still open. (However, global existence is known if the mass is sufficiently small, 
by a perturbative argument based on the contraction mapping principle and some 
harmonic analysis estimates; see [25].) An important recent work of Martel and 
Merle [45] , [60] , [49] , [47] , though, shows in this case that singularities can form for 
data arbitrarily close to the ground state (but of slightly larger mass); we discuss 
this result in more detail in Section [5] 

In the mass-supercritical case p > 5 the situation is very unclear, due to the 
lack of good a priori etimates in this case; it is likely that singularities do form in 
this case for large initial data, but this has not been rigorously established. It is 
however known that solitons are unstable in this setting. 

We return now to the mass-subcritical case p < 5, in which global existence and 
regularity are assured, and now consider the problem of stability of one of the soliton 
solutions (|4.4| . By taking advantage of the scaling and translation invariance in the 
problem, we can reduce matters to considering the stability of the standard soliton 
Q(x — t). For instance, if we are given a solution u which is close to this soliton at 
time zero, for instance in the sense that 

(4.7) ||u(0) - Q|| H i (R ) < a 

for some sufficiently small a, is this enough to guarantee that u stays close to (I2.1[) 
for much later times, thus 

(4-8) \\u(t)-Q(--t))\\ mm <a' 

for some other small a 1 depending on er, and for large times t? (For small times t, 
the local well-posedness theory allows one to obtain bounds of this form, but with 
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a' replaced by Ccrexp(Ci) for some constant C depending only on p.) One can 
phrase this question in other norms than the H 1 norm, of course, but this norm 
turns out to be rather natural due to its connection with the Hamiltonian (which 
we have already seen in (|4.2p ). 

This type of absolute stability of the soliton is too strong a property to hold, 
basically because it is not compatible with the scale invariance (|4.3j) . Indeed, con- 
sider the soliton solution (|4.4|) with xq = and c = 1 + O{o) very close to 1. Then 
(|4.7|) holds, but (|4.8p will fail for sufficiently large times t, because u has most of 
its mass (and H 1 norm) near ct, whereas (|2.ip has most of its mass near ct. The 
point is that by rescaling the soliton very slightly, one can adjust the speed of that 
soliton, which will eventually over time cause the perturbed soliton to diverge from 
the original soliton. Note that this conclusion has nothing to do with the H 1 norm, 
and would work for basically any reasonable function space norm. 

However, even though this perturbed soliton is far from the original soliton at 
late times t, it is still close to a translation of that original soliton (by ct — t). 
Equivalently, if we define the ground state curve 

£ = {Q(--xo) :x eR}c H 1 (R) 

consisting of all translates of the ground statffH Q, then we see that u(t) stays close 
to £ for all t. To put it another way, while u(t) does not stay close to Q(- — t) for 
each time t, the orbit {u(t) : t £ R} stays close to the orbit {Q(- — i) : i £ R} = S. 
Indeed, this is a general phenomenon: 

Theorem 4.1 (Orbital stability of sub-critical gKdV). [2], [5], [84] Let I <p<5. 
If uq € -ff 1 (R) is such that distjyi (uq, S) is sufficiently small (say less than a for 
some small constant a > 0), and u is the solution to f)l .5(1 with initial data uq, then 
we have 

distjj 1 ( u (t), £) < distal (uq, E) 
for all t. Here we use X < Y or X = 0(Y) to denote the estimate \X\ < CY for 
some C that depends only on p, and X <~*>Y as shorthand for X < Y < X . 

This theorem is proven by a variant of the classical Lyapunov functional method 
for establishing absolute stability. Let us briefly recall how that method works. 
Suppose we were able to find a functional u i— > L(u) on H 1 with the following 
properties: 

(1) If u is an H 1 solution to (|1.5|) . then L(u(t)) is non-increasing in t. 

(2) Q is a local minimiser of L, thus L(u) — L(Q) > for all u sufficiently close 
to Q in H 1 . 

(3) Furthermore, the minimum is non-degenerate in the sense that L(u) — 
L(Q) ~ \\u — Q\\%i for all u sufficiently close to Q in H 1 . 

These three facts would then easily imply that Q is absolutely stable. Indeed, if 
uo is close to Q, then L(uo) is close to (but not smaller than) L(Q), which implies 
that L(u(t)) is also close to but not smaller than L(Q) for all t > 0, which implies 
(by a continuity argument) that u(t) is close to Q for all t > 0. (The case t < can 



More suggestively, one should think of E as the space of all possible soliton states whose 
conserved statistics (in particular, mass and energy) agree with that of the ground state. In the 
case of the sub-critical NLS equation 11.21 . E then becomes a cylinder, formed by considering the 
action of both translation Q(x) i— > Q(x — xo) and phase rotation Q(x) i— > e l ®Q(x) on Q. In the 
critical case, the dimension of E increases due to the additional symmetry of scale invariance, as 
we shall shortly see. 
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then be handled by time reversal symmetry u(t, x) i— > u(—t, —x), or equivalently by 
considering L(u(— •)) instead of L(u(-)).) 

We already saw, though, that Q is not absolutely stable, and so such a Lyapunov 
functional cannot exist. However, we can still hope to obtain a modified Lyapunov 
functional u i— > L(u) which implies orbital stability instead of absolute stability. 
More precisely, we require L to be such that 

(1) If u is an H 1 solution to (|1.5|) . then L(u(t)) is non-increasing in t. 

(2) L(u) = L(Q) for all «eS. 

(3) E is a local minimiser of L, thus — > for all u sufficiently close 
to E in H 1 . 

(4) Furthermore, the minimum is non-degenerate in the sense that L(u) — 
L(Q) ~ distjji(u, E) 2 for all u sufficiently close to E in H 1 . 

It is not hard to see that this would imply Theorem l4.11 The task then reduces to 
locating the functional L with the stated properties. From properties 2,4 it seems 
reasonable to look for an L which is translation invariant. From property 1 and 
time reversal symmetry it seems reasonable to look for an L which is conserved, 
such as a combination of the mass M(u) and the energy H(u). It also has to be 
a functional for which Q is a local minimum, thus (|4. 5[1 should essentially be the 
Euler-Lagrange equation for L. With all these heuristics, one is soon led to the 
candidate 

L{u) := H(u) + M(u). 

It is then not hard to verify most of the required properties for L, especially if we 
define Q to be the minimiser of L. The one tricky thing is to show the strict non- 
degeneracy L{u) — L(Q) > distal (u, E) 2 when u is close to E. One difficulty here is 
the translation invariance of the estimate; if we do not break this symmetry, then we 
are forced to only use translation-invariant methods to establish the estimate, which 
greatly reduces the range of tools available. Hence we shall break the symmetry by 
decomposing 

u = Q(- - xq) + e 

for some small functional e S H 1 (R) and some xq £ R. There are a number of ways 
we can choose the parameter xq. The most obvious approach is to pick Q(- — xq) to 
be the translated ground state which is closest to u in H 1 norm, thus minimising 
||e||jji. By elementary calculus, this allows us to obtain the orthogonality condition 

(e,Q'(- -x )) H i(R) = 

where (u,v) :— J R uv + u x v x is the standard inner product on H . Other 
choices of Xq will lead to a slightly different orthogonality condition; some orthog- 
onality conditions are more suitable for some applications than others, but we will 
not explore this technical issue further here. 

We can then break (or "spend" ) the translation invariance by normalising xq to 
be zero, thus u = Q + e and e is orthogonal to Q' . (Note that Q' represents the 
infinitesimal action of the translation group at Q.) Now, since Q is a minimiser of 
L we have (formally, at least) the Taylor expansion 

L(Q + e) = L(Q) + \L"{Q){e, e) + 0(e 3 ) 



Note here that s = s(t, x) is denoting a function rather than a number! This notation is 
traditional in the literature. 
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where L"(Q) : ff-^R) x i? 1 (R) — * R is some explicit positive semi-definite sym- 
metric bilinear form. The task is then to show that 

L"(Q)(e,e)>\\ef Hl 

when e is orthogonal to Q' . (An orthogonality condition of this sort is necessary; 
since L is translation invariant, we easily verify that L"(Q) must annihilate Q' .) 
This is a spectral gap condition on L"(Q), which can be viewed as a positive-definite 
self-adjoint operator, and can be established by spectral methods; the key ingredient 
needed is a uniqueness result that asserts that Q and its translates are the only 
minimisers of L. Details can be found [84^ . 

The theory of orbital stability for very general dispersive models has now been 
extensively developed, see e.g. [23], [7]. 

Another way to state the above results is that if a global solution u starts off 
close to the ground state curve S, then at later times one has the decomposition 

(4.9) u(t, x) = Q(x - x(t)) + e(t, x - x(t)) 

for some function x : R — ► R (which tracks the position of the soliton component of 
u) and some error term e, which is small in H . We have the freedom to impose one 
(non-degenerate) orthogonality condition of our choice on e, such as (e, = 0, 

by choosing x(t) appropriately. 

The question then arises as to what happens to the error e over time, or to the 
position x(t). We return to the model example of the rescaled soliton (14. 4|) . In this 
case we can take x(t) = xq + ct and e(t, x) — c 1 ^ p ~ 1 ' l Q{c 1 /' 2 x) — Q{x) (this e does 
not quite obey the above orthogonality condition, but this will not concern us). 
Thus we see in this case that e does not disperse to zero in any sense. However, 
we can hope to "quotient out" this scaling and obtain a decomposition (|4.9p which 
has a better error term s. Indeed, if we replace the ground state curve E by the 
ground state surface 

(4.10) E' := {c^-^QCc 1 /^ - xq)) : c> 0, x e R} D E 

and approximate u(t) by an element of E', we can obtain a more refined decompo- 
sition 

(4.11) u{t,x) = R{t,x) +e(t,x) 
where R is a soliton-like state 

(4.12) R(t, x) := c{t) x '^-^Q{c{tfl 2 {x - x{t))) 

for some modulation functions i:R-*R and c : R — > R + (with c always close to 
1), and e is small in H 1 (one can take ||£||hi = 0(a) if distjji (uq, E) < a). The 
point is now that by enlarging the dimension of the approximating surface from 1 
to 2, the error e is now allowed to enjoy two orthogonality conditions rather than 
just one. There are again several choices of which orthogonality conditions to pick 
(anything which is suitably "transverse" to E' will do); a typical set of choices is 

(4.13) / R{t,x)e(t,x) dx = / (x - x(t))R(t, x)e(t, x) dx = 0. 

JR JR 



18 If 

one replaces the power nonlinearity in 11.51 with a more general nonlincarity, then this 
positive definiteness of L"(Q) can fail. In that case one can in fact show that the solitons are not 
orbitally stable. 
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Now we can hope that with such a refined decomposition that the error e will 
disperse, especially in the neighbourhood of x(t), so that in the vicinity of the 
solition R(t,x), the solution u converges locally to R(t,x). Let us informally say 
that the soliton is asymptotically stable if we have a result of this form. Such 
stability results can be obtained for the completely integrable cases p = 2, 3 by using 
the inverse scattering methods of the previous section. For general sub-critical p, 
such results were first obtained by Pego and Weinstein [68] and by Mizumachi [66] , 
for perturbations of the ground state which were strongly localised (e.g. assuming 
exponential decay at infinity) . More recently, Martel and Merle [46] , [50] were able 
to consider more general perturbations which were only assumed to be small in the 
energy norm H , this generalisation is important for the purposes of understanding 
how a soliton will collide with another shallow broad soliton, which may have small 
energy but will not have strong localisation properties. In particular, they showed 

Theorem 4.2 (Asymptotic stability for sub-critical gKdV). Let the notation and 
assumptions be as in Theorem \4-l\ Then we have a decomposition of the form 
(|4.1ip . with c(t) — c+ constant and close to I, x(t) differ entiable with 

(4.14) lim x'(t) = c+, 

t — > + oo 

and the error term e obeying the local decay \\s(t)\\Hi( x >0t) ~ > as t — > +oo for 
any (3 > 0. 

Roughly speaking, this asserts that as t — > oo, the solution resolves into a soliton 
moving to the right at an asymptotically constant speed c + , plus an error term 
which is radiating to the left; this is of course consistent with the soliton resolution 
conjecture. (In the case p = 4, and with the additional scale-invariant assumption 
that u — Q is small in H 1 ^ , a refinement of this result was given by the author, 
asserting that e in fact converges asymptotically to a solution of the Airy equation 

The estimate (14. 14|) implies the asymptotic x(t) = c + t + o(t). It is not entirely 
clear what the nature of the o(t) error is; one might naively expect to obtain a 
refined asymptotic of the form x(t) = c + t + x + + o(l), but it turns out that by 
inverse scattering methods one can give an example in the p = 2 case in which one 
has the asymptotic x(t) = t + n^/logt + o(v / TogT) for some k > 0. 

We now sketch the ideas used to prove Theorem l4.2l The first step is to pass from 
(|1.5|) . which is an equation describing the dynamics of u, to equations describing 
the dynamics of e, x(t), and c(t). This can be done by algebraic manipulation^. 
Indeed, if one substitutes (|4.11jl . (I4.12| into (II. 5|) . one eventually obtains the equa- 
tion 

(4-15) e t + e xxx + {pBP^e):, = F x[t)At ^ x , {t) ^ {t) + N(e, R) 

for e where the forcing term F x u\ c u\ x i{t),c'{t) (caused by changes in the modulation 
parameters) is the explicit smooth function 

(4.16) F x{t)tC(t)tX , {t)tC , {t) := --^(-^ + (x- x(t))R x ) + {x'(t) - c(t))R x 



^As always, we ignore the analytic issues of how to justify all the formal computations in 
the case when u is low regularity; this can be done by standard (and boring) regularisation and 
limiting arguments. 
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and N(e, R) (caused by self-interactions of the radiation term e) is the nonlinearity 

N(e, R) := ((R + e f - R p - pRP^s)^ 

As for the evolution of x(t) and c(i), one can differentiate (|4.13|) to obtain a 2 x 2 
linear system of equations (known as the modulation equations) expressing the 
evolution x'(t) and c'(t) of the modulation parameters in terms of various integrals 
involving R, e and its derivatives. The exact form of these modulation equations is 
not important for our purposes; the only thing which matters is the type of control 
that one gets on x'(t) and c'(t). By comparison with the soliton solutions (|4.4p 
one expects x'(t) to be close to 1, and c'(t) to be close to 0. For most choices of 
orthogonality conditions, the degree of this closeness will only be linear in e. But 
if one uses the specific orthogonality conditions (|4.13p . it turns out that there are 
particular cancellations which allow the error here to be quadratic in £, at least as 
regards the variation of the scale parameter c(t). Indeed, one can show after some 
computation (exploiting the exponential decay of R and its derivatives away from 
x(t)) that 

(4.17) \c'{t)\ + \x'(t) - 1| 2 < f e 2 (t,x)e-\ x - x W dx. 

This is a rather strong estimate; it asserts that the error term e only has a lin- 
ear influence on the velocity, and a quadratic influence on the change in scale, 
and only when a significant portion of the mass of e is stationed near the soliton. 
These bounds are particularly useful in controlling the size of the forcing term 

Fx{t),c(t),x'(t),c'{t)- 

The right-hand side of (|4.15j) now consists primarily of terms which behave 
quadratically or higher in e. This raises the hope that one can use perturbation 
theory to approximate the evolution here by that of the linearised equation e t + 
£xxx + (pR p ~ 1 £)x = 0. (There is still one term, namely the drift term (x'(t) — 
c(t))R x va (|4.16p . in the right-hand side which exhibits linear behaviour, but this 
term only causes a translation in e and is thus be manageable.) To do this, we neecffil 
to somehow exploit the fact that the linearised equation is trying to propagate e to 
the left, while the soliton is moving to the right. 

One strategy to achieve this is via an understanding of the linearised equation 
£t+£xxx+(pR p ~ 1 £)x — 0. After some rescaling one can replace R with Q here. If one 
works in suitably weighted spaces, then one can use spectral theory to obtain good 
decay properties for this evolution which can be used to neglect the nonlinear terms 
and recover asymptotic stability in the case of rapidly decreasing perturbations. 
However, this approach seems to become very delicate in the case of perturbations 
in the energy space. 

One particularly elegant way to achieve control on the error e for perturbations 
in the energy space is via virial identities, as carried out in [50]. Let us motivate 
these identities in the simple model case of the Airy equation (|1.3[) . This equation 
has a conserved mass, indeed one quickly computes using (| 1 . 3[) and integration by 
parts that 




20 If 

we do not exploit this fact, then our control on the dispersive effects of the linearised 
equation is too weak; we can only hope to obtain decay of 0(t~ 1 /' i ) on e at best, which is 
insufficient to allow us to neglect the quadratic nonlinearity terms. 
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To affirm the intuition that the mass of u should be propagating leftward, let us now 
introduce the virial quantity J R xu 2 , which one can think of as the mean position 
of u. We compute 

dt I xv? = I 2xuut 
Jr jr 

2xUUxxx 

R 



d t / {x-x(t))u 2 < -c\\u\\ 2 



(4.18) dt / {x-x{t))e 2 <-c\\e\\ z mm 



R 

3ul. 

R 

In particular, we see that J R xu 2 is a decreasing function of time, which is a quan- 
titative realisation of the intuition of leftwards propagation. If we instead replace 
x by x — x (£), we get even faster decay: 

d t I (x-x(t))u 2 = - I 3ul-x'(t) I u 2 . 
Jr Jr Jr 

In particular, if x'(t) > 1 (which is the situation we are in above), we have 

Iffi(R) 

'R 

for some c > 0. 

It turns out that one can do the same sort of thing for (|4.15|) . Indeed, one can 
show (after lengthy computations) that (formally, at least) we have 

.1 —.Mill . " ' _ 

Ir 

for some c > 0. This estimate strongly suggests that e will move to the left of 
the soliton over time. Unfortunately, this "global" virial identity cannot be used 
directly in the above analysis, because the integral on the left-hand side may be 
divergent due to lack of spatial decay on e. However, this can be rectified by the 
usual trick of localising the weight x—x(t). Indeed, one can show that for sufficiently 
large A > 1, we have the "local" virial identity 

d t f ^ A (x-x(t))s 2 < -c f (£ 2 + e 2) e -l— (*)IM 
Jr Jr 
for some bounded increasing function ty(x — x(t)) which equals x — x(t) for \x — 
x (t)\ < A, and is of magnitude 0(A) throughout. Thus the quantity J r ^a(x — 
x(t))e 2 is monotone decreasing, while also being controlled by A times the mass. If 
we then integrate this in time we obtain an important spacetime bound 

(4.19) f f [e 2 +e 2 x )e-\ x - x ^' A dxdt < An. 

Jr Jr 

This is the first indication of dispersion away from the soliton; it asserts that the 
radiation term e cannot linger near the soliton x(t) for extended periods of time. 

This estimate, combined with (|4.17p . is already enough to demonstrate conver- 
gence of the scale parameter c(t) to an asymptotic limit c+. It also shows that the 
forcing term in (|4. 15|) decays quite quickly in time; in particular, the quadratic na- 
ture of the nonlinearity shows that it decays integrably in time, with the exception 
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of the drift term (x'(t) — c(t))R x which can be dealt with by hand. Because of this, 
it is possible to use energy estimates to conclude the full strength of Theorem 14.21 
We now describe an alternate approach to asymptotic stability, also due to Martel 
and Merle [46) , which uses a more sophisticated and general strategy which has since 
shown to be useful for many other equations, including critical equations. The basic 
strategy is to use the compactness- and- contradiction method, which we informally 
summarise as follows. 

(1) Suppose we wish to show some asymptotic property of a solution u(t) as 
t — > +oo. We assume for contradiction that this property does not occur. 

(2) By using weak compactness, we then extract a sequence t n of times going to 
infinity in which (suitably normalised versions) of the state u(t n ) are weakly 
convergent in some sense, but which violate the property in some quanti- 
tative manner. In particular, (suitably normalised versions of) u(t + t n ) 
should converge weakly to some asymptotic solution (t) of the original 
equation (now defined for all times t € R), which continues to violate the 
desired property. 

(3) By using the dispersive properties of the equation, show that the asymptotic 
solution Uoo(t) obeys some strong compactness properties, or equivalently 
that the evolution t t— > itoc(^) is almost periodic in some strong topology. 
(At this point Uoa is behaving somewhat like the dispersive analogue of a 
solution to an elliptic PDE or variational problem.) 

(4) Using more dispersive properties of the equation, upgrade the strong com- 
pactness to obtain further regularity and decay of the solution. (This step 
is roughly analogous to the exploitation of elliptic regularity in the theory 
of elliptic PDE.) 

(5) Establish a Liouville theorem or rigidity theorem, that the only solutions 
close to solitons which exhibit strong compactness, regularity, and decay 
properties are the solitons itself. This is the most difficult step, and often 
requires full use of the conservation laws and monotonicity formulae of the 
equation. (This is analogous to Liouville theorems in elliptic PDE, the 
most famous of which is the assertion that the only bounded holomorphic 
or harmonic functions on C or R d are the constants.) 

(6) We conclude that is itself a soliton, which we then combine with the 
fact that it violates the required property to obtain a contradiction. 

The compactness-and-contradiction method is extremely powerful in analysing 
many nonlinear parabolic and dispersive equations, for instance a variant of this 
method for Ricci flow also plays a crucial role in Perelman's recent proof of the 
Poincare conjecture. Another variant of this method is also very useful in estab- 
lishing large data global well-posedness results for critical equations, though we will 
not discuss this topic further here. The one drawback of the method is that, by 
being indirect and relying so strongly on compactness methods, it does not easily 
provide any sort of quantitative bound in its conclusions, in contrast to the previous 
arguments used to prove Theorem l4.21 which were direct and easily provide explicit 
bounds. 

Let us now sketch how this method is applied to give a new proof of Theorem l4.2l 
Actually we will just prove the slightly weaker claim that the translated radiation 
terms e(t, x — x{t)) converges weakly in iJ : (R) to zero as t — > +oo; note this weak 
convergence implies for instance that s(t,x — x(t)) converges locally uniformly to 
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zero, and so the radiation term eventually vacates the neighbourhood of the soliton. 
One can upgrade this convergence to obtain results closer in strength in Theorem 
14.21 but we will not do so here. 

To prove this weak convergence claim, we use the compactness-and-contradiction 
method. Suppose for contradiction that e(i, x — x(t)) does not converge weakly to 
H X (R) as t -> +00. Since e is bounded in H , weak compactness then shows 
that there exists a sequence of times t n — > 00 such that e(t n , x — x(t n )) converges 
weakly in H 1 to some non-trivial limit £00(0,2;); one can also assume that c(t n ) 
converges to some limit c + . Due to some weak continuity properties of the gKdV 
flow (which can be proven by harmonic analysis methods) one can then show that 
u(t + t n ,x + x(t n )) converges weakly (and locally in time) to some limiting solution 
Uooit, x) = Roa(t, x) + £oo{t, x) , where Roo and £00 obey similar estimates to R and 
£, and Roo is defined using some modulation parameters Coo(i) and Xoo(i). 

The normalised radiation terms £oo(t,x — x(t)) stay bounded in H . By the 
Rellich compactness theorem, this means that they are locally precompact in L 2 , 
i.e. their restriction to any compact spatial interval I lies in a compact subset of 
L 2 (I). We now assert that these terms are in fact globally precompact in L 2 . This 
is equivalent to asserting that for any S > 0, we must have some radius R such that 
we have very little mass on the left, 



We briefly sketch why one would expect these claims to be true. Suppose that 
(I4.2ip failed, then a non-zero portion of the mass of £oo at some time would be far 
to the right of the soliton. Returning to the original solution, we see that there 
exist arbitrarily large times t for which a significant portion of the mass of e(t) is 
to the right of x(t). Now we evolve backwards in time, back to time 0. Away from 
the soliton, mass has a tendency to move leftwards as one goes forwards in time, 
and thus rightwards as one goes backwards in time. One can make this precise (by 
using crude forms of the local virial identity alluded to before), and conclude that 
at time 0, a significant portion of the mass of e(0) is to the right of x(t). But t can 
be arbitrarily large, and so x(t) can be arbitrarily large also (recall that x'if) stays 
close to 1). This contradicts the monotone convergence theorem, and so (|4.21[) . 

The proof of (|4.20[) is similar; if (|4.20|) failed, then at some point a non-zero 
portion of the mass of £oo lies far to the left of the soliton, and thus we have strictly 
less than M(eoo) of the mass of £oo near or to the right of the soliton. By the above 
discussion, we see that we have strictly less than M(£oo) of the mass of e(t) near 
or to the right of x(t) for a sequence of arbitrarily large times t. But by using local 
virial-type identities to control the propagation of mass, this loss of mass to the left 
is irreversible, and in fact we have strictly less than M(soo) of the mass of e(t) near 
or to the right of x(t) for all sufficiently large times t. But then it is not possible 
for e(t + i n , x + x(t n )) to converge weakly to £oo(t, x), a contradiction. 

It turns out that one can upgrade the bounds (|4. 20(1 . (|4.2ip significantly, to 
obtain a pointwisc uniform exponential decay estimate of the form 



(4.20) 




and very little mass on the right 



(4.21) 




(4.22) 



c OO 



c\x—x{t) I 
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for some c > 0. This is established by a long-time analysis of (14.15|) and exploits the 
fact that the fundamental solution to the Airy equation (|1.3|) decays exponentially 
fast in the rightwards direction. This uniformity estimate is crucial in what follows. 

It then remains to establish the Liouville theorem that if Eoo is a sufficiently 
small (in H 1 ) solution to a nonlinear equation (14. 1 5[) with £ca(t, x — x(t)) compact 
in L 2 , then must vanish. To prove this, we first use another compactness-and- 
contradiction argument in order to eliminate the nonlinear terms in (|4. 15|1 . If the 
claim failed, then we could find a sequence of solutions e„ to (|4.15[) which converged 
to zero in H 1 norm as n — > oo, and were each compact in L 2 , but were non-zero. 
Normalising each e n by its H 1 norm, we see from from the uniform estimate (|4.22|) 
that the resulting sequence is still compact in L 2 . Thus we can take a limit and 
obtain a nontrivial solution e to the linearised equation 

£t + £xxx + {pR p ~ 1 £)x = a(t)R x 

for some scalar quantity a(t), which stays compact in L 2 and bounded in H 1 (and 
obeys the orthogonality conditions (|4.13f) ). The task is thus to show that there 
is no such solution other than the trivial solution e = 0, which will establish the 
Liouville theorem and thus Theorem 14.21 

At this point, one can now use the global virial estimate (|4. 18[1 . which is valid 
here due to the exponential decay of e. If e is non-trivial, it has an H 1 norm 
bounded away from zero, which in conjunction with the L 2 compactness shows 
that the right-hand side of (|4.18[) is negative and bounded away from zero. But 
this forces J R (x — x(t))e 2 to go to — oo, which contradicts the exponential decay of 
e. This finally finishes the argument. (An alternate proof of this "linear Liouville 
theorem" was also recently established in [44].) 

We conclude with a number of miscellaneous remarks. 

Remark 4.3. This proof was significantly more complicated than the direct proof, 
but the underlying strategy is much more powerful: it uses compactness methods to 
strip away all the inessential portions of the dynamics, leaving a very smooth and 
localised solution to which global estimates can be applied. This dispenses with 
the need for any cutoffs in space or frequency, which can significantly complicate 
the analysis. 

Remark 4.4. These arguments have recently been simplified and generalised further 
(to handle arbitrary power-type nonlinearities) in [52] . Interestingly, this result 
does not require any spectral hypotheses on the linearised problem. Also, some 
sharper asymptotics for special nonlinearities (in particular the p — 4 case) have 
been obtained in [80], [51] . 

Remark 4.5. The above arguments crucially rely on the tendency of gKdV evolu- 
tions to propagate non-soliton mass in one direction, while the soliton itself moves 
in another direction, leading to some crucial monotonicity formulae. These meth- 
ods can be extended to some other models [17], [IS], [BB]. For NLS models however, 
there are only weak analogues of these formulae (see [58]), although for some special 
nonlinearities (in which spectral hypotheses on the linearised operator are assumed) 
asymptotic stability for such equations can be recovered [8] , [69] , [70] . The situation 
is particularly well understood in the radial case, in which the soliton remains tied 
to the spatial origin and so the degeneracies in stability associated with translation 
invariance are eliminated. For a survey of these issues see [15] ■ 
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Remark 4.6. In a sort of converse to asymptotic stability (analogous to the existence 
of wave operators in scattering theory), it is also possible to construct solutions with 
prescribed asymptotic behaviour of the type described above; see [T2], [13], [14] . 

5. The critical case 

We now discuss the more difficult mass-critical case p = 5, in which the scale 
invariance now plays a much more delicate role. The details are rather technical 
and we shall only paraphrase or sketch the key arguments here, referring the reader 
to the original papers for full details. For further surveys of the results in this 
section, see [55], [56], [82] , 

The new difficulty in the critical case can be seen by considering the one- 
parameter family of soliton states Q c (x) := c 1 ^ p ^ 1 - ) Q(c 1 / 2 a:). A short computa- 
tion shows that M(Q C ) = c*=t - *M(Q) and E(Q C ) = c^ + ^E{Q). If p ^ 5, the 
exponents here are non-zero (and M(Q) and E{Q) are also non-zero), and so the 
conservation of mass or energy prohibits a solution which starts near Q from ending 
up near Q c for any c ^ 1. But in the critical case p = 5, we have M(Q C ) = M(Q), 
while one can compute that E(Q C ) = E(Q) = 0. Thus, the mass and energy conser- 
vation laws do not prohibit the possibility of drift from Q to Q c , or more generally 
along the entire ground state surface (|4.10[) . on which the mass is always M (Q) and 
the energy is always 0. Indeed, numerics [6] suggest that solutions u starting near 
a ground state will increase their scale parameter c to infinity in finite time, thus 
leading to finite time blowup (thus, for instance lim^y- ||u(t)||#i = +oo for some 
finite time T). An inspection of the linearised operator also supports the possibility 
of drift to increasingly finer scales. 

Henceforth we fix p = 5. As discussed earlier, the sharp Gagliardo-Nirenberg 
inequality of Weinstein[83] allows one to show that no blowup occurs as long as 
M(u) < M(Q), in particular there are initial data arbitrarily close to the ground 
state for which one has global existence. A refinement of this analysis also allows 
one to consider the situatior0 in which M(Q) < M{u) < M(Q) + a for some 
small a > 0, as long as the energy E(u) is negative (this situation can again occur 
arbitrarily close to the ground state Q, as long as the mass is strictly greater than 
M(Q)). In this case, we can again (up to a harmless change of sign, u i— > —u) obtain 
a decomposition of the form (|4.1ip over the lifespan of the solution. However, a 
key difference in the critical case is that the scale parameter c(t) can go to infinity 
in finite time. 

Henceforth we assume M{Q) < M(u) < M(Q) + a for sufficiently small a, 
and also E(u) < 0. It will be convenient to make the error e (|4.11[) dimensionlcss 
(i.e. invariant under the scaling symmetry) by replacing 14. 1 1 [1 with the equivalent 
decomposition 

u{t, x) = citf'^QicitY^ix - x{t))) + e{t, citf^ix - x{t)))]. 

As before, one can select two orthogonality conditions on e; it turns out to be con- 
venient to require (e(t),Q 3 ) = (e(t),Q') = 0. In that case one can use Weinstein's 
analysis to show (after replacing u with — u if necessary) that e(t) is small in H 1 



21 The case M(u) = M(Q) was subsequently treated in |48l . in which blowup was shown to be 
impossible. 



WHY ARE SOLITONS STABLE? 



25 



(see e.g. [60] for detail^l). In particular we see that ||u(i)||#i ~ c(i) 1 / 2 , and so 
blowup in the H 1 norm is equivalent to c(t) going to infinity. 

Suppose for the moment that the solution u existed globally in time, with c(t) 
bounded both above and below, and furthermore that s(t) ranged in a compact 
subset of L 2 (so in particular, one has bounds such as (|4.20|) . (|4.2ip ). Then the 
same Liouville theorem analysis used in the subcritical case can be used to show 
that e = 0, which would of course contradict the assumption that M(u) is strictly 
greater than M(Q); see [15] for details. By repeating the rest of the subcritical 
analysis, this is already enough to deduce asymptotic stability in the case when 
c(t) is bounded both above and below. However, this statement is vacuous due to 
the results in [60], which in fact show that c(t) — > oo as t — ► T, where < T < oo 
is the maximal time of existence. 

We sketch the proof of this result as follows. Once again we use a compactness 
and contradiction argument. If the above claim failed, then one could find a se- 
quence u = u n of solutions, each with some finite or infinite lifespan < T n < +oo, 
with mass M(Q) < M(u n ) < M(Q) + o(l) and E(u n ) < 0, such that the velocity 
function c n {t) stayed bounded in t for each n (though with a bound depending 
on n). For each n, we consider the quantity c n ,* := lim inf c n (t). Then c n .* is 
bounded; from the negative conserved energy one can also show c n ^ to be non-zero. 
Thus we can find a time t n for each n such that c n (t n ) is very close to c Mi », and 
that c n (t) is either close to or larger than c raj * for all t > t n . By rescaling and 
translating in time if necessary we may take c n {t n ) — 1 and t n = 0. 

For each n, we see from construction that c n (t) is always greater than or close 
to 1, and returns for some infinite sequence of times i n ,m — > T n to be close to 
1. This situation is similar to the previous situation "c„(<) bounded above and 
below", except that we now allow c n to oscillate between being close to 1 and 
being extremely large. It turns out that with enough care, one can use local mass 
propagation estimates much as before to obtain exponential decay similar to (|4.22|) 
at and near the times t n ^ ni or more precisely for a limiting error profile e n>oa 
obtained as a weak limit of e n (t + t n ^ m ,x). In particular, this places the limiting 
error profile e ni0 o (and thus the limiting solution profile u„ i00 ) in L . This allows one 
to deploy a final conservation law, namely the mean J R M n ,oo(i)- The conservation 
of this quantity can be used to show that the limiting velocity parameter Coo(i) is 
bounded both above and below, at which point the Liouville theorem ensures tht 
£n,oo must vanish. But this turns out to be incompatible with the negative energy 
hypothesis. This contradiction establishes the desired claim that one has blowup 
either at finite or at infinite times. 

It turns out that one can analyse these solutions further. For this it turns out 
to be convenient to change the orthogonality conditions slightly, so that one now 
requires 

(s(t), | + vQ y ) = (e(t), y(| + vQ y )) = 

where y denotes the spatial variable. (The expression ^ + yQ y can be viewed as 
the infinitesimal scaling vector field at Q.) These conditions are convenient for 
applying virial identities. It is also convenient to introduce a dimcnsionless time 



22 In the paper 1601 and the other papers cited here, the wavelength parameter A(t) := c(t) -1 / 2 
is used rather than the scale parameter c(t), but this of course makes only a minor notational 
difference to the argument. 



26 



TERENCE TAO 



variable s defined by the ODE 

(5.1) != c3/2 W' 

One can show that regardless of whether T is finite or infinite, that s — > +00 as 
t — > T. The reason for these choices of coordinates is that the error e = e(s, y) now 
obeys an analytically tractable PDE, which takes the form 

e s = (Ls) y + N(e,Q) 

where L is the linear operator L := —d xx + l — 5Q 4 and N is a nonlinear (and slightly 
non-local) expression which consists of terms which are quadratic and higher in e, 
and which are largely localised in space (due to the fact that most terms involve at 
least one factor of the localised function Q). The finer structure of N is important 
for certain computations, but for simplicity we will not delve into the explicit form 
of N here. 

If c is now viewed as a function of s rather than t, then from the preceding 
analysis we have c(s) — ► 00 as s — ► 00. However, this divergence to infinity could 
potentially be quite oscillatory; there is no a priori reason why c(s) should be 
monotone. Also we do not have a priori knowledge as to the rate of growth of c in 
s. 

Suppose however that we had a time interval [si, S2] (which could potentially be 
very long) in which c(s) varies between c(si) and c(s2) = l.lc(si) (say), thus there 
is a slight focusing effect along this interval. Suppose also that c(s) > c(si) for 
all times s > S2, and also that we have an exponential localisation such as (|4.22|) 
on the time interval [si,S2] (actually, for this argument, we only need exponential 
localisation to the left of the soliton). It turns out that there is no solution of this 
form (assuming our hypotheses of negative energy, and mass close to that of the 
ground state, of course). This key fact is established in [49], and we sketch the 
proof as follows. We may rescale so that c(si) = 1, thus c(s) ~ 1 for s £ [31,82]- 
The localisation places e in L 1 (to the left, at least). We introduce a quantity J 
that measures the amount of L 1 mass of e that is to the left of the soliton; the 
precise definition of J is 

(5.2) J(s) := / e(s,y)( [ + {% + zQ z ) dz) dy - -( [ Q) 2 . 



2 ' " 4 . 

It turns out that J obeys a differential equation of the form 

^(c~ 1/4 J) =-2c-^ 4 J eQ + Oic- 1 ^ J e 2 e-^ 2 / 2 dy), 

which reflects the fundamental fact that the non-soliton portion of mass tends to 
propagate to the left. On the other hand, the conservation of mass and energy 
eventually gives us a lower bound 



1,„, 1 



eQ > -\E(u)\ + - I ef, dy + 0( I e z e~^ " dy) 



and so on the interval [si, 82] (where c lies between 1 and 1.1) we have 
(5.3) ^( C ' V4 J ) + ai J £ l d V + 0-l|£(u)| <0{J £ 2 e-^ 2 / 2 dy) 

(say). On the other hand, on [s\, S2] the quantity c increases from 1 to 1.1, which 
together with (|5.2[) tells us that c -1 / 4 J increases over this interval by at least some 
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absolute constant S > (recall from (|4.22|) that e is small in L 1 ). Thus if we 
integrate (|5.3|) over [si, S2] we arrive at a bound of the form 

1 + r I £ " v dy + lE(u)llS2 " Sl1 - ° ( / S2 / e2eHy|2/2 d »)- 

On the other hand, virial identity arguments (similar to those used to derive (I4.19p ) 
combined with the hypothesis that c(s) is comparable to 1 on [si,S2], allow us to 
establish a bound of the form 

J* J e 2 e~\y\ 2 ' 2 dy < Ca x l\\ + jT J s 2 y dy + \E(u)\\s 2 - 

where a :— M(u) — M(Q). For a sufficiently small, this leads to the desired 
contradiction. 

By combining the above result with the Liouville theorem discussed earlier, and 
another compactness-and-contradiction argument, it was shown in [49 that there 
existed a sequence of times s m — > 00 such that e(s m ) converged weakly (in H 1 ) 
to zero. Indeed, since we know c(s) — > 00 as s — > 00 and is continuous in s, 
we can define s m to be the last time for which c(s m ) = (l.l) m . If e(s m ) did 
not converge weakly to zero, then (after passing to a subsequence if necessary) it 
would converge to some other limiting error profile £00 (0), which can be viewed 
as the error at time zero for some limit solution profile u oc (s), with its attendant 
velocity parameter Coo(s), which is bounded from below by Coo(0) for all s > by 
construction. A careful application of local mass propagation inequalities (see [49] 
for details) allows one to establish exponential decay on £oo(s) on the left. If Coo(s) 
increases to l.lCoo(O) in finite time, one can then use the previous argument to 
obtain a contradiction. If instead Coo(s) ranged between Coo(0) and l.lCoo(O) for all 
s, then one can use further local mass propagation to get some decay on the right 
as well. The Liouville theorem then would force to vanish, again leading to a 
contradiction. Thus we obtain the weak convergence of e(s m ) to as claimed. 

A further compactness-and-contradiction argument in 49J allows one to strengthen 
this claim, to assert that e(s) converges weakly to for all times s — > 00, not just a 
subsequence s m — ► 00 of times. For if this were not the case, one could find another 
sequence s' m — > 00 of times for which e(s' m ) was converging to a non-zero limit 
e'aciO), associated to a limiting solution profile u'^. But because any sufficiently 
large s' m can be placed between a pair s m i, s m i+\ of times where e is close to zero 
(in the weak topology), it is possible to use local mass conservation to show that 
Miu'oo) < M(Q); meanwhile, the energy E^'^) can be shown by limiting argu- 
ments to be non-positive. But this, together with the orthogonality conditions on 
e'oo, forces to vanish, a contradiction. 

Finally, in [47j , it was shown that solutions of the above form in fact blow up in 
finite time (thus giving theoretical confirmation of the numerical blowups observed 
for instance in [5]), provided that one make an additional assumption of decay on 
the right (e.g. it would suffice to have J °° it(0, x) 2 x 6+s dx < 00 for some S > 0). 

We very briefly sketch the main ideas as follows. In view of the relation (|5.ip . 
some numerology shows that finite time blowup will eventuate if we can show that 
4- log c(s) grows (on average at least) like c(s)~ 3 / 2+s for some 5 > 0. In fact, in 
[4T] it is shown that 

(5.4) log c(s) > SlEiuMs)- 1 



28 



TERENCE TAO 



"on the average", for some absolute constant 5 > 0, which leads to finite time 
blowup for negative energy data. 

A calculation of the dynamics of c(s) reveals an equation of the form 

f g loge( S ) = 2 J e L((| + yQ y ) y ) dy + 0(J eV^ 100 dy). 

The error term here is controllable by virial identities. But the quantity J eL((® + 
yQy)y) dy is somewhat oscillatory and is not easy to control directly. To avoid this 
problem, the arguments in [47) introduce a new decomposition 

u(t, x) = c'(i) 1/4 [Q(cW /2 0E " x'{t))) + e'(t, c'(i) 1/2 (x - x'{t)))] 
where e' obeys some rather different orthogonality conditions, namely 

/(/" | + Qz dz)e'(y) dy = J y(| + Q y )e' dy = 0. 

Because ^ + Q z dz is non-zero, such orthogonality conditions are only rea- 
sonable when u has sufficient decay on the right. Fortunately, the hypothesis of 
polynomial decay on the right at time zero, together with some local mass propa- 
gation estimates, turn out to give enough decay to make this decomposition viable 
(see [17] for details). We also have an associated rescaled time variable s' defined by 
ds'/dt = d{tf' 2 . The point of perfo rming this decomposition is that the equation 
for c'(s') is simpler than that for c(s): 

A logcV) =2je'Q + 0{J (s'fe-^ wo dy). 

Furthermore, one can show that c and d are comparable, as are e and e' , in various 
technical senses. In particular, the quantities J e'Q and J eQ can be related to 
each other modulo acceptable errors. On the other hand, mass and energy con- 
servation considerations eventually let one establish a lower bound on J sQ of the 
form S\E(u)\/c(s), modulo acceptable error terms. The claim follows. 

Remark 5.1. The above analysis in fact gives some more explicit upper and lower 
bounds on the blowup rate (i.e. the behaviour of c(t) as t approaches T); see [47] 
for details. 

Remark 5.2. For the analogous problem for mass-critical NLS, blowup from neg- 
ative energy data can be obtained easily from a virial identity argument, at least 
when the data is localised in space; see [22], while explicit blowup solutions can 
also be constructed via the pseudo-conformal symmetry of this equation. However, 
the corresponding virial identities for gKdV equation, while leading to important 
estimates such as (14.19[) . seem to be unable to coerce blowup just by themselves. 

Remark 5.3. The situation for the supercritical equation p > 5 is still poorly un- 
derstood; due to some crucial changes of sign, many of the methods used above fail 
completely. Numerics such as those in [6] continue to suggest finite time blowup 
starting near soliton initial data in this case, but the dynamics are known to be 
unstable [7J and so it is unlikely that a controlled blowup of the type seen above 
will hold, at least for generic data. 
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6. Further developments 

We now briefly survey more recent developments regarding stability of multisoli- 
tons, and on collisions between solitons. Due to the vast explosion in activity in 
these areas (for a wide variety of dispersive models), we will not be able to give 
a comprehensive bibliography here, instead focusing on a representative subset of 
results. 

There has been some progress in understanding the stability aspects for multi- 
solitons - superpositions of two or more solitons. As the underlying equations are 
non-linear, constructing these solutions is non-trivial, and indeed existence of such 
solutions is usually established simultaneously with stability results. When the 
solitons are far apart and receding from each other, one is in a perturbative regime 
and can in some cases glue together the stability theory for single solitons to create 
multisolitons. See [57] for an instance of this for subcritical gKdV multisolitons, 
and [70], [35], [SS] for some results for subcritical NLS multisolitons. 

More difficult is the question of what happens when two solitons collide. Re- 
cently there has been some work on the collision between fast thin solitons and 
slow broad solitons [53], [54] ; the situation is still perturbative, but there are no- 
ticeable nonlinear effects, such as a shift in position in the fast soliton caused by 
nonlinear interaction with the slow soliton. Somewhat similarly in spirit, there 
has been some recent work in [27], [26] investigating the collision between a fast 
soliton and a localised potential (such as a delta function potential). The fully non- 
perturbative situation of two large slow solitons colliding is however still beyond 
current technology. 

In supercritical cases, solitons are generally unstable. However in some cases the 
number of unstable directions is finite, and so finite dimensional stable manifolds 
can (in principle) be constructed. This turns out to be a rather delicate issue 
(relying, among other things, on strong control on the spectrum of the linearised 
operator, which can contain some non-trivial resonances); see [33] . |35] for recent 
developments. 

There has also been many recent papers on the blowup in the neighbourhood of 
a soliton to a scale-invariant evolution equation; these results do not quite follow 
exactly the same pattern as the results for critical gKdV mentioned in the previous 
section, but certainly share many of the same ingredients. For instance, see [61] . 
[62] . [63] for some results relating to the mass-critical NLS. For the energy-critical 
non-linear wave equation or wave maps equation, there are some slightly different 
approaches to create blowup [34], [37], [71]; see [72] for a recent survey. 

The compactness-and-contradiction approach has also been applied to derive 
the absence of blowup (i.e. global existence) for critical equations in the defocusing 
case, or in focusing cases in which the solution is "smaller" in mass or energy than 
that of the ground state. See [2g]. [5T]. [5U| 
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